LLM 25-Day Course - Day 10: Open-Source LLM Ecosystem Overview

Day 10: Open-Source LLM Ecosystem Overview

Beyond Llama and Mistral, numerous open-source LLMs are competing in the space. Understanding each model’s characteristics helps you choose the optimal model for your project.

Global Open-Source LLM Comparison

Model	Developer	Size	Strengths	License
Qwen 2.5	Alibaba	0.5B~72B	Multilingual, coding, math	Apache 2.0
DeepSeek V2	DeepSeek	236B (21B active)	MoE, cost-effective	MIT
DeepSeek-Coder V2	DeepSeek	236B	Code-specialized, 338 languages	MIT
Phi-3	Microsoft	3.8B / 7B / 14B	Ultra-small, high performance	MIT
Yi-1.5	01.AI	6B / 9B / 34B	Strong in Chinese + English	Apache 2.0
Command R+	Cohere	104B	RAG-specialized, multilingual	CC-BY-NC
Falcon 2	TII (UAE)	11B	Strong in Arabic	Apache 2.0

Korean-Specialized Models

Model	Developer	Size	Features
SOLAR	Upstage	10.7B	DUS (Depth Up-Scaling) technique
EXAONE 3.0	LG AI Research	7.8B	Korean + English bilingual
HyperCLOVA X	NAVER	Undisclosed	Best Korean performance, proprietary
Polyglot-Ko	EleutherAI	1.3B~12.8B	Korean pre-trained
KoAlpaca	Community	Various	Alpaca Korean fine-tuning

Model Selection Guide

def recommend_model(task, budget, korean_priority):
    """Recommend a model based on task and conditions"""
    recommendations = {
        ("coding", "low", False): "DeepSeek-Coder-V2-Lite (16B)",
        ("coding", "high", False): "DeepSeek-Coder-V2 (236B)",
        ("general", "low", False): "Phi-3 Mini (3.8B)",
        ("general", "medium", False): "Qwen 2.5 72B",
        ("general", "low", True): "EXAONE 3.0 7.8B",
        ("general", "medium", True): "SOLAR 10.7B + Korean fine-tuning",
        ("general", "high", True): "Qwen 2.5 72B (excellent Korean performance)",
        ("math", "low", False): "Qwen 2.5 Math 7B",
        ("math", "high", False): "DeepSeek-Math 7B",
        ("rag", "medium", False): "Command R+ (104B)",
    }

    key = (task, budget, korean_priority)
    return recommendations.get(key, "Qwen 2.5 or Llama 3.1 recommended")

# Usage examples
print(recommend_model("coding", "low", False))
print(recommend_model("general", "low", True))
print(recommend_model("general", "medium", False))

Qwen 2.5 Execution Example

# Ollama: ollama pull qwen2.5:7b
import ollama

response = ollama.chat(
    model="qwen2.5:7b",
    messages=[
        {"role": "system", "content": "Please answer in English."},
        {"role": "user", "content": "Write a generator that produces the Fibonacci sequence."},
    ],
)
print(response["message"]["content"])

# HuggingFace approach
# from transformers import AutoModelForCausalLM, AutoTokenizer
# model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-7B-Instruct")

Korean Model Comparison Experiment

import ollama

# Korean performance comparison test
korean_prompts = [
    "Describe the characteristics of Korea's four seasons in one sentence each.",
    "Explain the meaning of the Korean proverb 'If the words you speak are kind, the words you hear will be kind too.'",
    "Briefly explain the civil service examination system of the Joseon Dynasty.",
]

models_to_test = ["llama3.1:8b", "qwen2.5:7b", "gemma2:9b"]

for prompt in korean_prompts:
    print(f"\nQuestion: {prompt}")
    print("=" * 60)
    for model_name in models_to_test:
        try:
            response = ollama.chat(
                model=model_name,
                messages=[{"role": "user", "content": f"Please answer in Korean. {prompt}"}],
            )
            answer = response["message"]["content"][:150]
            print(f"  [{model_name}] {answer}...")
        except Exception:
            print(f"  [{model_name}] Model not installed")

Open-Source LLM Trends Summary

Trend	Description
Small model dominance	Models like Phi-3, Gemma 2 at 3~9B achieving 13B-level performance
MoE expansion	Growing number of efficient MoE models like DeepSeek, Mixtral
Code specialization	Code-specific models like DeepSeek-Coder, CodeLlama
Multilingual enhancement	Improved non-English language performance in Qwen, EXAONE, etc.
License relaxation	Expanding commercially usable licenses like Apache 2.0, MIT

Today’s Exercises

Install Qwen 2.5 7B via Ollama and send 3 coding questions. Compare with Llama 3.1 8B and evaluate which model performs better.
Research what SOLAR’s DUS (Depth Up-Scaling) technique is and compare it with conventional model scaling methods.
Select the most suitable open-source model for your project and document the reasoning. (Consider task, budget, hardware, and language requirements)